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ABSTRACT 

This  research  work  is  a  continuation  of  the  study  that  was  previously  made  on  the  association  of  sandalwood 
with  neighboring  species  in  Pachaimalai  hills,  Eastern  Ghats,  Tamilnadu,  India.  We  add  a  regression  model  through 
reliability  of  estimates  to  the  procedure  in  order  to  predict  one  data  instance  from  contributing  more  than  others. 
We  also  utilize  statistical  analysis  for  the  data  processing  and  the  experimental  results  show  a  very  high  utility  of 
modeling  plant  association  problems.  The  reliability  of  estimates  and  regression  analysis  done  in  the  work  shows  close 
associations  and  relationships  between  two  plant  species  clustering  around  Sandalwood  in  almost  all  the  area  under 
study  in  Pachaimalai  hills. 
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INTRODUCTION 

Pachaimalai,  also  known  as  the  Pachais  (Sonia  et  al.,  2015),  are  hills  which  are  part  of  Eastern  Ghats  in 
Tamil  Nadu  in  Trichy  district,  located  near  11°11'N  78°21'E  /  11.18°N  78.35°E  /  11.18;  78.35.  They  are  much 
greener  than  some  of  the  other  hills  in  the  vicinity.  Extending  over  an  area  of  about  5,200  square  miles  (13,500 
square  km),  they  form  a  discontinuous  line  of  highlands  with  a  general  elevation  from  1,770  to  4,620  feet  (540  to 
1,400  meters).  The  economy  of  the  region  is  based  on  agriculture;  rice,  jowar  (grain  sorghum),  sugarcane,  gram 
(chickpeas),  peanuts  (groundnuts),  and  bajra  (pearl  millet)  are  subsistence  crops.  Coffee,  cashews,  and  pepper  are 
important  plantation  crops  raised  for  export.  Pachaimalai  is  known  for  its  thick  vegetation  of  Sandalwood  and 
bamboo.  Santalum  album  or  Indian  sandalwood  is  a  small  tropical  tree,  and  is  the  most  commonly  known  source  of 
sandalwood.  This  species  has  historically  been  cultivated,  processed  and  traded  since  ancient  times.  Certain  cultures 
place  great  significance  on  its  fragrant  and  medicinal  qualities.  The  high  value  of  the  species  has  caused  its  past 
exploitation,  to  the  point  where  the  wild  population  is  vulnerable  to  extinction,  especially  in  Pachaimalai  hills 
known  for  its  rich  vegetation  in  sandalwood.  For  a  sandalwood  tree,  a  fruit  is  produced  after  three  years  and  viable 
seeds  after  five  years  and  these  seeds  are  distributed  by  birds.  Plant  association  is  defined  as  the  grouping  of  plant 
species,  or  a  plant  community,  that  recurs  across  the  landscape.  Plant  associations  are  used  as  indicators  of 
environmental  conditions  such  as  temperature,  moisture,  light  etc.  It  can  be  viewed  as  a  collection  of  plant  species 
within  a  designated  geographical  unit,  which  forms  a  relatively  uniform  patch,  distinguishable  from  neighboring 
patches  of  different  vegetation  types  (Willner,  2006;  Yang  et  al.,  2010;  Schleicher  et  al.,  2011).  The  components  of 
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each  plant  community  are  influenced  by  soil  type,  topography,  climate  and  human  disturbance.  In  many  cases,  there  are 
several  soil  types  within  a  given  phytocoenosis.  Many  researchers  have  applied  data-mining  and  recent  advancements  in 
various  environmental  situations  in  different  parts  of  the  globe  (Andreopoulou  et  al.,  2011;  Chaudry,  2013;  Rojas-Mora  et 
al.,  2013;  Selmaui-Folcher  et  al.,  2011).  By  using  the  fuzzy  correlation  analysis,  fuzzy  partial  correlation,  fuzzy  semi- 
partial  correlation  and  fuzzy  multiple  correlation  rules  for  fuzzy  numbers  are  generated  to  see  that  two  fuzzy  sets  not  only 
frequently  occur  together  in  same  records,  but  also  are  related  to  each  other.  Correlation  measures  and  data  mining 
algorithm  with  numerical  results  were  discussed  in  detail  in  our  earlier  work  (Sonia  et  al.,  2015).  In  this  paper,  we  attempt 
to  find  some  useful  relationships  through  reliability  of  estimates  in  theregression  analysis  for  the  same  numerical  results  of 
the  species  clustering  around  sandalwood  in  Pachaimalai  hills. 


MATERIALS  AND  METHODS 

Since  the  distribution  of  sandalwood  in  the  Pachaimalai  hills  is  not  even  or  unpredictable,  surveying  the  frequency 
of  plant  associations  with  sandalwood  is  a  tedious  job.  Hence  we  identified  twelve  potential  spots  in  the  hills  where  the 
distribution  of  the  same  is  considered  notable  and  the  surveyed  frequency  data  is  converted  into  fuzzy  numbers  due  to  the 
irregular  distribution  of  the  plant  communities.  Four  different  plant  species  are  found  to  be  closely  associated  with 
sandalwood  and  they  seem  to  recur  in  all  the  surveyed  spots.  In  this  paper,  we  utilize  bio-statistical  rules  and  determine  the 
association  between  sandalwood  with  the  identified  four  other  plant  species.  Fuzzy  correlation  rules,  fuzzy  partial 
correlation  rules,  fuzzy  semi-partial  correlation  rules,  fuzzy  multiple  correlation  rules  and  fuzzy  regression  analysis 
(Han  &Kamber,  2012;  Robinson  et  al.,  2012;  Agrawal  et  al.,  1993)  are  used  for  identifying  the  plant  associations. 

Case  Study  of  Plant  Association  with  Sandalwood  in  Pachaimalai  Hills 

Assume  that  T  ={t1,t1,t3,t4,t5,t6,t1,ts,t9,t10,tn,tv}  is  a  random  sample  with  12  fuzzy  records  shown  in  Table  1, 

and  F  ={/j,/2,/3,/4,/5}is  the  set  of  observed  fuzzy  items.  In  this  study,  we  observe  five  plant  species  occurring 

frequently  almost  in  all  transactions  of  a  botanical  survey  conducted  in  Pachaimalai  hills,  the  Eastern  Ghats  for  analyzing 
plant  associations  for  sandalwood.  The  four  plant  species  found  to  be  distributed  around  sandalwood  in  the  hills  are  Acacia 
pennata,  Celtisphilippensis,  Zizyphusoenoplia,  and  Miliusaeriocarpa. 


Table  1:  A  Random  Sample  with  12  Fuzzy  Records 


X 

F, 

Santalum  album 

f2 

Acacia  Pennata 

f3 

Celtisphilippensis 

f4 

Zizyphusoenoplia 

Fs 

Miliusaeriocarpa 

tl 

0.78 

0.71 

0.65 

0.6 

0.56 

t2 

0.98 

0.89 

0.82 

0.75 

0.7 

t3 

0.56 

0.51 

0.47 

0.43 

0.4 

t4 

0.66 

0.6 

0.55 

0.51 

0.48 

t5 

0.4 

0.36 

0.33 

0.31 

0.29 

t6 

0.16 

0.15 

0.13 

0.12 

0.11 

t7 

0.34 

0.31 

0.28 

0.26 

0.24 

t8 

0.9 

0.82 

0.75 

0.69 

0.64 

t9 

0.22 

0.2 

0.18 

0.17 

0.16 

ti» 

0.64 

0.58 

0.53 

0.49 

0.46 

ti, 

0.3 

0.27 

0.25 

0.23 

0.21 

1 12 

0.48 

0.44 

0.4 

0.37 

0.34 

Impact  Factor  (JCC):  3.9996 


NAAS  Rating:  4.08 
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The  data  presented  in  table  1  is  generated  using  the  following  fuzzy  membership  functions  for  the  frequency  of 
the  five  different  plant  species  occurring  together  frequently. 


Acacia  Pennata  Celtis  philippensis  Zizyphus  oenoplia  Miliusa  eriocarpa 


Figure  1:  Plant  Species  Clustering  Around  Sandalwood  in  Pachaimalai  Hills 

fl-f2-f3-f4-f5 


Figure  2:  Three  Dimensional  Interpretation  of  the  Comparison  of  the  Plant  Species 

The  fuzzy  correlation  coefficient  is  :  r,  ,  =0.98,  rf  ,  =0.99,  r,  ,  =0.95. 

J\>J2  /W3  J2  ’73 

Partial  correlation  is  obtained  as,  rx  p3)  =  0.90,  f  (32j  =  0.95,  r2  pij  =  -0.72. 

Semi-partial  correlation  is  obtained  as, 

rr,(2-3)  =  0- 13’  ri,(3-2)=0-19’  r2,(3D=-°A4’  ^2,(1 3)  ~  0.28,  ^  =  0.30,  = -0.10. 

This  multiple  correlation  coefficient  is  obtained  as,  R](2  3)  =  0.9982,  R2(]  3)  =  0.9902,  R3n  2)  =  0.9957. 


REGRESSION  ANALYSIS 


Let  us  define  CO  = 


,2  2 


rn 

ri2 

ri3 

r2i 

r22 

r23 

r3i 

r32 

r33 

+  2r 
^  ^'12 

r23r 

Where  f ,  —  r22  —  L3  —  1  •  Expanding  we  get 


Let  COi  be  the  cofactor  of  r . ,  C0l , 


r22 

r23 

S 

to 

II 

1 

r2l 

r23 

II 

1 

r2l 

r23 

r32 

V33 

r3l 

r33 

r3l 

r32 

The  equation  of  the  plane  of  regression  referred  to  as  original  origin  is  Y,  -  A,  ^  X2-X2  ^  X3-X3  ^  _  n 

wj  j  I  vXJj 2  vdj 3  O 
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Here, 


(0.1  = 


1  0.95 

0.95  1 


=  0.0975,  (ol2  =  - 


0.98  0.95 
0.99  1 


-0.04,  to.3  =  - 


1  0.98 

0.99  0.95 


=  0.02 


Expanding  we  get,  (o  =  1  -  (0.98)2  -  (0.95)2  -  (0.99)2  +  2(0.98)(0.95)(0.99)  =  0.00038 


L*r 


Now  <72  =  -= — ! —  Xi  —  0.059,  cr22  =^^-Z2  =0.0499,  a\  =  ^^-Z3'  =0.0409. 


1*3 


Since  X,  =  0.54, Z2  =  0.49, Z3  =  0.45,  now,*'  °'54 (0.0975)+  *2  °49 (-0.04)  +  °'45 (0.02)  =  0. 


0.24 


0.22 


0.2 


X,  =0.43 14  +  0. 448Z2 —0.2462Z3  is  the  equation  of  regression  plane  of  A.  on  A0  and  X 


RELIABILITY  OL  THE  ESTIMATES 

As  for  the  two  variable  case,  a  measure  of  the  reliability  of  the  estimates  of  the  dependent  variable  A.  lies  in  the 
deviations  of  observed  A,  values  from  the  estimated  A.c  values  based  on  the  regression  equation  of  A.  on  X 1  and  A3  , 
as  in  above.  These  deviations  (A,  —  A1C  )  are  given  in  the  table  below: 


Table  2:  Estimated  Values  for  Regression  for  Dependent  *i 


*2 

*3 

A1C 

(x,-xlc) 

(x,-xlc)! 

0.78 

0.71 

0.65 

0.58945 

0.19055 

0.0363 

0.98 

0.89 

0.82 

0.628236 

0.351764 

0.1237 

0.56 

0.51 

0.47 

0.544166 

0.015834 

0.00025 

0.66 

0.6 

0.55 

0.56479 

0.09521 

0.00906 

0.4 

0.36 

0.33 

0.511434 

-0.111434 

0.01242 

0.16 

0.15 

0.13 

0.466594 

-0.306594 

0.094 

0.34 

0.31 

0.28 

0.501344 

0.161344 

0.026 

0.9 

0.82 

0.75 

0.61411 

0.28589 

0.0817 

0.22 

0.2 

0.18 

0.476684 

-0.256684 

0.0659 

0.64 

0.58 

0.53 

0.560754 

0.079246 

0.0063 

0.3 

0.27 

0.25 

0.049081 

0.19081 

0.0364 

0.48 

0.44 

0.4 

0.53004 

-0.05004 

0.0025 

Here  "S'  (Z.  —  Zlc )'  =  0.49453.  A  measure  of  the  reliability  of  the  estimates  of  the  dependent  variable  X 1  lies  in 
the  deviations  of  observed  A0  values  from  the  estimated  X1C  values  based  on  the  regression  equation  of  A0  on  A,  and 
a3  ,as  in  above.  These  deviations  ( X2  —  Xlc )  are  given  in  the  table  below: 
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Table  3:  Estimated  Values  for  Regression  for  Dependent  X2 


*2 

*3 

X2C 

{X2-X2C) 

(*3-X2c)! 

0.78 

0.71 

0.65 

1.133575 

-0.423575 

0.1794 

0.98 

0.89 

0.82 

1.67299 

-0.78299 

0.613 

0.56 

0.51 

0.47 

0.544065 

-0.034065 

0.00116 

0.66 

0.6 

0.55 

0.811025 

-0.151025 

0.0228 

0.4 

0.36 

0.33 

0.110335 

0.289665 

0.0839 

0.16 

0.15 

0.13 

-0.534765 

-0.278235 

0.0774 

0.34 

0.31 

0.28 

-0.05094 

0.39094 

0.15283 

0.9 

0.82 

0.75 

1.456125 

-0.636125 

0.4047 

0.22 

0.2 

0.18 

-0.37349 

0.57349 

0.3289 

0.64 

0.58 

0.53 

0.755435 

-0.175435 

0.0308 

0.3 

0.27 

0.25 

-0.156625 

0.426625 

0.182 

0.48 

0.44 

0.4 

0.3272 

0.1128 

0.0127 

X(X2-X2c)2  =2.08959. 


A  measure  of  thereliability  of  the  estimates  of  the  dependent  variable 
values  from  the  estimated  X3C  values  based  on  the  regression  equation  of 
deviations  (  X3  —  X3C  )  are  given  in  the  table  below: 


X^  lies  in  the  deviations  of  observed  X2 
x3  on  X t  and  X  2 ,  as  in  above.  These 


Table  4:  Estimated  values  for  regression  for  dependent  X, 


x2 

*3 

x3C 

(x3-xsc) 

0.78 

0.71 

0.65 

-0.1238 

0.7738 

0.5988 

0.98 

0.89 

0.82 

-0.6087 

1.4287 

2.0412 

0.56 

0.51 

0.47 

0.40595 

0.06405 

0.0041 

0.66 

0.6 

0.55 

0.1635 

0.3865 

0.1494 

0.4 

0.36 

0.33 

0.78295 

-0.45295 

0.2052 

0.16 

0.15 

0.13 

1.37575 

-1.24575 

1.5519 

0.34 

0.31 

0.28 

0.9357 

-0.6557 

0.4299 

0.9 

0.82 

0.75 

-0.4111 

1.1611 

1.3482 

0.22 

0.2 

0.18 

1.223 

-1.043 

1.0878 

0.64 

0.58 

0.53 

0.20835 

0.32165 

0.1035 

0.3 

0.27 

0.25 

1.0254 

-0.7754 

0.6012 

0.48 

0.44 

0.4 

0.60355 

-0.20355 

0.0414 

X  (X3 -Xxf  =2.08959. 


The  standard  error  of  estimate,  now  denoted  as  is  obtained  from  the  sum  of  the  squared  deviations  as 


Z(x-x< 

123  N 


...  IZ(x-xc)2  s  = 

N  ’  312  V  N 


An  alternate  method  of  computation  of  S]  23  in  terms  of  simple  correlation  coefficients  ri2>  rn  and  r23  i 


is 


*^1.23  —  *^1 


( 1  -  4 ) 


1  r[2  A.  r23  -fAA-. .  g  |1  rn  f  A  ^  j  1  rn  rn  ri3  f  -h2h3r23  As  before, 


(1_ru) 


('-4) 
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represents  the  average  difference  between  the  observed  values  and  the  corresponding  estimated  Xlc  values. 
The  only  difference  being  that  Xlc  values  are  now  computed  on  the  basis  of  the  linear  regression  equations  with  Xf  as 
the  dependent  variable  and  X2  and  X.,  as  two  independent  variables.  The  subscript  1  before  the  decimal  point  signifies 
the  dependent  variable  a)  .and  the  subscript  2  and  3  after  the  decimal  point  signify  the  two  independent  variables  X n  and 

*3- 


When  computing  the  standard  error  of  an  estimate  S1  based  on  the  data  given  in  the  table  comes  to: 


i, ,  - - 0.203:  5, „  - ft,  1-0.%04-0.9801-0.902^2(0.08)(0.00)(0.05)  _ 
'  12  V  1-0.9025 


Similarlly,  y  _ 


2.08959 


12 


=  0.4173;  5,  l3  =0.24 


1-0. 9604 -0.9801 -0.9025 +  2  (0.98)  (0.99)  (0.95) 


1-0.9801 


=  0.0332 


and 


1.1625 


12 


=  0.8247;  5, 


=  0.22. 


1-0. 9604 -0.9801 -0.9025 +  2  (0.98)  (0.99)  (0.95) 
1-0.9604 


=  0.0216. 


Least  Square  Method  for  Two  Variable  Cases 

In  the  method  of  least  square,  the  values  of  a  and  b  are  obtained  by  solving  simultaneously  the  following  pair  of 
normal  equations  ^  Y  =  aN  +  bJ'X;  ^  XY  -a^X  +/;^  X1  in  which  N  is  the  total  number  of  paired  observations  in 

the  sample.  The  method  of  least  squares  seeks  to  minimize  w  =  £(T-rc)2;  w=£(y-a  -foX)2,  with  respect  to  a  and  b. 

Where  and  x  XY-NXY 

N^X'-CZxf  ~YjX1-NX1' 


A  measure  of  the  error  of  estimate  is  given  by  the  standard  error  of  estimate  of  Y  on  X,  denoted  as  SY  x  and 

[77  7777 

defined  as 


$r.x  ~ 


r 


( Y~Ycf 


N 


Table  5:  Estimated  Values  for  Least  Square  Regression 


S.  No 

*1 

*2 

*3 

*!2 

*2 

*3 

*1*2 

*2*3 

*3*1 

1 

0.78 

0.71 

0.65 

0.6084 

0.5041 

0.4225 

0.5538 

0.4615 

0.507 

2 

0.98 

0.89 

0.82 

0.9604 

0.7921 

0.6724 

0.8722 

0.7298 

0.8036 

3 

0.56 

0.51 

0.47 

0.3136 

0.2601 

0.2209 

0.2856 

0.2397 

0.2632 

4 

0.66 

0.6 

0.55 

0.4356 

0.36 

0.3025 

0.396 

0.33 

0.363 

5 

0.4 

0.36 

0.33 

0.16 

0.1296 

0.1089 

0.144 

0.1188 

0.132 

6 

0.16 

0.15 

0.13 

0.0256 

0.0225 

0.0169 

0.024 

0.0195 

0.0208 

7 

0.34 

0.31 

0.28 

0.1156 

0.0961 

0.0784 

0.1054 

0.0868 

0.0952 

8 

0.9 

0.82 

0.75 

0.81 

0.6724 

0.5625 

0.738 

0.615 

0.675 

9 

0.22 

0.2 

0.18 

0.0484 

0.04 

0.0324 

0.044 

0.036 

0.0396 

10 

0.64 

0.58 

0.53 

0.4096 

0.3364 

0.2809 

0.3712 

0.3074 

0.3392 

11 

0.3 

0.27 

0.25 

0.09 

0.0729 

0.0625 

0.081 

0.0675 

0.075 

12 

0.48 

0.44 

0.4 

0.2304 

0.1936 

0.16 

0.2112 

0.176 

0.192 

I 

6.42 

5.84 

5.34 

4.2076 

3.4798 

2.9208 

3.8264 

3.188 

3.5056 

Impact  Factor  (JCC):  3.9996 
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The  regression  equation  of  X{  on  x2  is  XiC  —  ci  +Z?^2* 

Here  ^ _  Z I X2  ~Z *, I xtX2  _  (6.42)(3.4798) -(6-42X3.8264)  =_QO908 
NZX22-{HX2f  12(3.4798) -34. 1056 

bJLX\X2~XX\X2  =  3-8264 -(12)(0.49)(0.54)  =  Hence,  v  =  -  0.2908 +  1.0879X 
XX22-AX22  3.4798 -12(0.49)2 


Table  6:  Estimated  Values  for  Least  Square  Regression  for  Dependent  X, 


S.  No 

*1 

*2 

Xic 

(X,-X1C) 

(x,-x,c)! 

1 

0.78 

0.71 

0.481609 

0.298391 

0.089 

2 

0.98 

0.89 

0.677431 

0.302569 

0.0915 

3 

0.56 

0.51 

0.264029 

0.295971 

0.0876 

4 

0.66 

0.6 

0.36194 

0.29806 

0.0888 

5 

0.4 

0.36 

0.100844 

0.299156 

0.0895 

6 

0.16 

0.15 

-0.127615 

0.287615 

0.0827 

7 

0.34 

0.31 

0.046449 

0.293551 

0.0862 

8 

0.9 

0.82 

0.601278 

0.298722 

0.0892 

9 

0.22 

0.2 

-0.07322 

0.29322 

0.086 

10 

0.64 

0.58 

0.340182 

0.299818 

0.0899 

11 

0.3 

0.27 

0.002933 

0.297067 

0.0882 

12 

0.48 

0.44 

0.187876 

0.292124 

0.0853 

Z((X -Xc)2)  =1-0539-  sl2  =  ^  =  0-2964 

The  regression  equation  of  X2  on  x3  is  —  c  A  dX 2 

Here, c  =  -0.2388,  d  =  1.1043.  Hence X2C  =-0.2388  +  1.1043X3. 


Table  7:  Estimated  Values  for  Least  Square  Regression  for  Dependent  X, 


S.  No 

*2 

*3 

x2C 

(X2-X2C) 

(• x2-x2C )! 

1 

0.71 

0.65 

0.478995 

0.231005 

0.0534 

2 

0.89 

0.82 

0.666726 

0.223274 

0.0499 

3 

0.51 

0.47 

0.280221 

0.229779 

0.0528 

4 

0.6 

0.55 

0.368565 

0.231435 

0.0536 

5 

0.36 

0.33 

0.125619 

0.234381 

0.0549 

6 

0.15 

0.13 

-0.095241 

0.245241 

0.0601 

7 

0.31 

0.28 

0.070404 

0.239596 

0.0574 

8 

0.82 

0.75 

0.589425 

0.230575 

0.0532 

9 

0.2 

0.18 

-0.040026 

0.240026 

0.0576 

10 

0.58 

0.53 

0.346479 

0.233521 

0.0545 

11 

0.27 

0.25 

0.037275 

0.232725 

0.0542 

12 

0.44 

0.4 

0.20292 

0.23708 

0.0562 

Z((Z2-^2c)2)  =0-6578;  Su 


0.2341. 
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The  regression  equation  of  X3  is  X3C  —e  +  fX2 

Here,  e  =  0.4042,  /  =0.8323.  Hence  X3C  =  0.4042 -0.8323X,. 


Table  8:  Estimated  Values  for  Least  Square  Regression  for  Dependent  X3 


S.  No 

*3 

*3c 

(N-Nc) 

(x3-xJC)2 

1 

0.65 

0.78 

1.053394 

-0.403394 

0.1627 

2 

0.82 

0.98 

1.219854 

-0.399854 

0.1599 

3 

0.47 

0.56 

0.870288 

-0.4003518 

0.1602 

4 

0.55 

0.66 

0.953518 

-0.403518 

0.1628 

5 

0.33 

0.4 

0.73712 

-0.40712 

0.1657 

6 

0.13 

0.16 

0.537368 

-0.407368 

0.1659 

7 

0.28 

0.34 

0.687182 

-0.407182 

0.1658 

8 

0.75 

0.9 

1.15327 

-0.40327 

0.1626 

9 

0.18 

0.22 

0.587306 

-0.407306 

0.1659 

10 

0.53 

0.64 

0.936872 

-0.406872 

0.1655 

11 

0.25 

0.3 

0.65389 

-0.40389 

0.1631 

12 

0.4 

0.48 

0.803704 

-0.403704 

0.163 

X((*3-*3C)2)  =  1.9631;  S3A=^(X3nX^ 


Comparison  of  Least  Square  Method  for  Two  Variable  Cases  with  Three  Variable  Cases 


Comparing  Sl  2  with  Sj  23 ,  we  notice  that  the  magnitude  of  the  variations  in  the  two  variable  case  is  more  than  in 


three  variable  case.  This  means  that  the  estimated  Xlc  values  are  closer  to  the  observed  values  Xl  when  the  estimating 
regression  equation  has  two  independent  variables  than  when  it  has  one.  This  difference  which  represents  increased 
accuracy  is  obviously  due  to  the  influence  of  the  second  independent  variable  X  v  In  other  words,  it  means  that  a  part  of 

the  variations  in  Xl  which  are  unexplained  by  X1  in  the  two  variable  cases  are  explained  by  the  additional  independent 
variable  X3  in  the  three  variable  cases.  If  the  net  influence  of  X  ,  were  zero,  it  will  make  no  difference  to  the  estimated 
Xlc  values  irrespective  of  X3  being  or  not  being  in  the  estimating  equation.  But  as  long  as  there  is  a  correlation  between 
the  dependent  variable  X{  and  any  additional  independent  variable,  the  inclusion  of  the  same  in  the  regression  estimating 
equation  will  reduce  the  error  in  the  estimates  of  X{C  and  improve  their  accuracy. 

DISCUSSIONS 

A  random  sample  T  —{t1,t2,t3,t4,t5,t6,t1,ts,t9,t10,tn,tv}  with  12  fuzzy  records  shown  in  Table  1  was  initially 
taken  at  the  beginning  of  this  study,  and  F  =  {fl,f2,f3,f4,f5}  was  the  set  of  observed  fuzzy  items  which  is  the  plant 

species  mostly  occurring  in  places  where  sandalwood  is  found.  Fuzzy  regression  was  computed,  considering  the  three 
variables  {/,,/,,  /}.  The  whole  study  reveals  a  close  relation  and  association  between  the  variables  in  the  set  /3}. 
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It  is  identified  that  the  plant  species /j, /2,  and/3  are  the  most  frequently  occurring  among  the  five  total  species  under  study 
(i Santalum  album  -  Acacia  pennata  -  Celtisphilippensis).  It  is  seen  that  the  plant  species  Acacia  pennata  and 
Celtisphilippensisare  very  much  closely  associated  than  the  other  plant  species  identified  in  this  study.  A  brief  description 
of  the  botanical  characteristics  of  the  above-mentioned  plant  species  is  given  as  follows. 

BOTANICAL  NAME:  ACACIA  PENNETTA  (F2) 

Family:  Miniosaccac(Touch-me-not  Family) 

Climbing  Acacia  is  a  perennial  climbing  shrub  or  a  small  tree.  The  stem  is  thorny.  Young  branches  are  pubescent, 
green  in  color  and  turn  brown  with  age.  A  large  gland  is  present  on  the  main  spine  of  leaves  above  the  middle  of  the 
petiole.  Leaves  are  double-compound,  pinnae  8-18  pairs,  leaflets  linear-oblong,  smooth,  base  oblique  or  truncate,  up  to  50 
pairs  per  pinna,  hairy  on  the  margins  loosely  set  and  overlapping.  Flowers  are  in  large  panicles  at  the  end  of  branches. 
They  are  spherical,  pale  yellow  flower-  heads.  The  pods  are  thin,  flat  and  long  with  thick  sutures. 

Medicinal  uses:  In  India,  leaf  juice  mixed  with  milk  is  used  for  the  treatment  of  indigestion  in  infants.  It  is  also 
used  for  scalding  of  urine  and  for  curing  bleeding  gums.  Some  people  use  boiled  tender  leaves  for  cholera  treatment, 
digestive  complaints,  relief  of  a  headache,  body  pain,  snake  bites,  and  even  to  cure  fish  poisoning.  The  root  can  be  used 
for  inducing  flatulence  and  to  cure  stomach  pain.  The  bark  is  used  for  the  treatment  of  bronchitis,  asthma  and  for  stomach 
complaints. 

BOTANICAL  NAME:  CELTISPHILIPPENSIS  (F3) 

Family:  Cannabaceae 

Trees  grow  to  about  12  m  tall.  The  bark  is  usually  grey,  lenticellate  and  blaze  brownish.  The  young  branchlets  are 
angular  and  pubescent.  The  leaves  are  simple,  alternate  and  distichous;  stipules  lateral,  pubescent,  caducous  and 
leaving  scar;  The  petiole  is  1  to  1.5  cm  long,  canaliculated  above  and  pubescent  when  young.  The  lamina  is  4.5  to  11.5  cm 
and  elliptic  to  ellipticovate,  the  apex  acute  to  acuminate,  the  base  rounded  or  acute,  sometimes  asymmetric,  the  margin 
being  entire.  The  flowers  are  inflorescence  axillary  cymes,  polygamous  and  usually  appear  on  new  branchlets.  The  fruit  is 
a  drupe  type  up  to  0.8  cm  long  and  has  one  seed. 

Economic  Importance:  The  native  Malayalis  tribe  people  of  Pachaimalai  hills  use  the  tree  for  firewood  and  for 
building  construction  purposes. 

CONCLUSIONS 

The  proposed  research  work  has  dealt  with  various  concepts  related  to  fuzzy  bio-statistical  rules  including 
regression  analysis  and  reliability  estimates  to  identify  closely  related  plant  species  around  sandalwood  in  the  Pachaimalai 
hills,  Tamilnadu,  India.  A  roadmap  to  different  correlation  measures  and  regression  analysis  were  clearly  displayed  in  the 
numerical  case  study.  The  study  revealed,  after  careful  examination  and  computations,  that  two  out  of  the  four  plant 
species  are  found  to  be  very  closely  associated  with  sandalwood  in  Pachaimalai  hills. 
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